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Parallel DNA sequoicing methods 



The presrait invention relates to a method of sequencing a target polynucleotide, a 
metibiod of comparing polynucleotide sequences and a mettiod of resolving 
ambiguities in a polynucleotide sequence. 

The currait methods of sequencing nucleotides are botii time consuming and 
expensive. Sequencing tiie human genome for the first time took more than ten years 
arid hundreds of miUions of dollars. If re-sequencing of genomes and de novo 
sequencing of pathogens and model organisms could be performed several ordcars of 
magnitude faster and more cheaply it would open up new avenues for disease genetics 
and fimctional genomics. 

Sanger dideoxy sequencing which provides sequence information rather indirectiy, by 
looking at the differences in gel-migration of a ladder of terminated extension 
reactions provided the means to sequence the consensus human genome. Now, 
however, tiie need for large scale re-sequencing of individual human genomes and de 
novo sequencing and re-sequencing in pathogens and model organisms require 
cheaper and faster alternatives to be developed. 

Several methods that would avoid gel electrophoresis, cloning or die Polymerase- 
chain reaction (PGR) have been suggested. One method, "sequencing by syntiiesis" 
(SbS), involves the identification of each nucleotide immediately following its 
incorporation by a polymerase into an extending DNA strand. One SbS approach, 
pyrosequencing, is widely used for SNP (single-nucleotide polymorphism) typing. In 
' tiiis case, the detection is based on pyrophosphate (PPi) release, it's conversion to 
ATP, and tiie production of visible Ught by firefly luciferase. However, because the 
signal is diffusible, pyrosequencing cannot take advantage of the massive degree of 
paralleUsm that becomes available when surface immobiUsed reactions are analysed. 



WO96/27025 discloses a sequencing by syntiiesis strategy, which involves a 
removable blocking group. In this method the nucleotide which is incorporated is 



modified so that is has a blocking group, which prevents the addition of further 
nucleotides, and a label. Orice the incorporated nucleotide has been identified the 

bloclmg.s^upioemo __ 

downside of this approach is that the chemistry used to remove the blocking group 
can damage the DNA and it is difficult to ensure that the deblockmg reaction goes to 
completion which means that sequence acquisition on individual molecules can 
dephase firom each other. 

Another strategy removes an entire base rather than just a blocking group. 
US2003013101 discloses a method of removing a base in its entirety, using the 3' to ' 
5 exonuclease activity of a DNA polymerase. The use of enzymes is gentle on the 
J3NA^dj©actions-can-bastoichiometric,-How-ever-,-t^^ — 
nucleotides than are added during a specific synthesis cycle, potentially causing errors 
and repetitions in the sequence. 

The present invention provides new sequencing by synthesis methods to overcome the 
shortcomings of the methods disclosed in the prior art. 

In one aspect the present invention provides a method of sequencing a target 
polynucleotide comprising the steps of: 

(a) Carrying out template derived nucleotide synthesis utilising a labelled 
nucleotide; 

(b) detecting the presence or absence of said labelled nucleotide; 

(c) replacing said labelled nucleotide with an unlabelled nucleotide; and 

(d) repeating steps a) to c); 

v/ith the proviso cliat if said.labelled nucleotide iz labelled \-/iich a label directly 



"Template derived polynucleotide synthesis" as used herein means forming a 
polynucleotide molecule utilising a polymerizing reagent that specifically 
mcorporates nucleotides consistent with the well.know Watson Crick base pairing 
rules, using the target nucleotide sequence as a template. The incorporation may be of 
nucleotide analogues nucleotide mimics or other molecules which can be templated 
by a polynucleotide and in which pairing is by well defined rules (Eckardt et al 2002; 
Czlapinski. et al, 2001). For example, high-fideUty templating of DNA base shape 
mimics without forming Watson Crick Bonds has been reported (Delaney et al, 2003). 
Vice versa the template may be any inolecule which can template polynucleotide 
synthesis. 

Polymerizing reagents include DNA polymerases, RNA polymerases, RNA 
transcriptases, reverse transcriptases, or Ugases, as well as chemical reagents tiiat 
enable template directed polymerization. As used herein "polymerising reagent" also 
includes molecules or complexes tiiat are capable of enfordng high fidelity base 
pairing according to well defined rules, regardless of whether they catalyse the 
addition of a single nucleotide. They can be natural, such as tiiose Usted above and 
ribozymes, or ariifical such as abzymes . The polymerizing reagent may comprise one 
or more chemical reagents. For example, template directed Ugation can be mediated 
by chemical reactions ( Xu et al, 2001; G. von Kiedrowski, 1986). 

The target polynucleotide and synthesised polynucleotide can each independentiy be 
strands of RNA or DNA. The DNA can be genomic DNA, or cDNA. The RNA can 
be mRNA, or genomic RNA, such as tiiat from a virus. Alternatively the target 
polynucleotide and/or syntiiesised polynucleotide can have an amide backbone 
formed tiirough peptide nucleic adds (PNA) or a ribose P backbone, as formed by 
DNA. 

The syntiiesis process can involve annealing a primer to tiie template polynucleotide. 
The primer can then be extended by template derived syntiiesis. The primer consists 
of 5-100 nucleotides, preferably 10-75, 15-65, 20 - 55, 25 - 50, or 30 - 45 
nucleotides. The primer may be labelled. The primer may be made and then 
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hybridised to the target polynucleotide. Alternatively nicks can be made in double 
stranded molecules using for example, Deoxyribonuclease 1 (DNAse 1) optimised so 

^diat the distance between-each.niclcis f easona^y-defmed. -T^ intact-strand is the- — 

target polynucleotide to which a series of primers are annealed. Synttiesis, and thus 
5 sequencing can start at each nick site and the non-template strand become 

progressively displaced. Whether a nick seeds displacement synthesis in a sense or 
antisense strand is revealed by the direction of migration of the sequencing signal. 
The template polynucleotide is preferably attached to a solid surface. 

10 The template can be attached indirectly to a surface, via a polymerizing reagent which 
is attached to the surface or it can be captured by a capture probe/primer. The capture 
may be of a sin^e stranded target or a cohesive termini or "sticky end " of a double 
stranded template. 

15 RNA Promoters native to the template DNA can be used for RNA synthesis by RNA 
polymerase. Alternatively, extrinsic promoters for specific polymerases can be 
incorporated by being part of a capture probe or by transposon directed integration 
into sites along the polynucleotide. T7 and T3 RNA polymerase promoters are 
preferred extrinsic promoters. 

20 

The term "labelled nucleotide" as used herein means any of the standard 
deoxyribonucleotides, or ribonucleotides which is attached to a label. Alternatively 
the nucletoides include any modified nucleotides or variations with other bases which 
pair with other bases according to defined rules, such as the Watson-Crick base pair 
25 rules. Thus the labelled nucleotidexan be a labelled peptide nucleotide capable of 
fQrrrPTijT VlIIl^ 



sequence information. The random portion stabiHses the interaction with the template 
and provides sites for attachment of the label. 

The label can be radioactive (such as ^^P), or more preferably a fluorescent tag. The 
fluorescent tag may be a dye molecule such as a fluorphore, for example Cy3 or Cy5 
(Amersham, UK), ROX (carboxy -x- rhodamine), TAMElA(tetramethybhodamine). 
Oregon Green ®, Vistra Green ™, Huorescein, PicoGreen ® , BODIFPY ® and Texas 
Red ®. Such flurophones are commercially available, for example, from Atto-tech 
(Germany) or Molecular Probes CUSA) (Kricka U.). Alternatively the label can be a 
tag which can be identified due to it's physiochemical properties, eg electrophoretic 
mobility or an electric charge. Alternatively a raman signal can be detected, for 
example Surface enhanced raman scattering (SERRs) (Kneipp 1999; Zander 2002). 

The label can be attached directiy through a covalent bond to liie nucleotide, or via a 
linkage. The Unkage preferably comprises a cleavable bond, for example a 
photocleavable bond, or a bond which is cleavable by a mild reducmg agent, such as a 
disulfide bridge. 

The processing of sequencing relies on the base pairing that occurs between 
nucleotides to form a double stranded polynucleotide molecule, according to tiie 
Watson-Crick base paring rules. At each position in a nucleotide molecule, one of tiie 
four ftucleotides can be incorporated. The nucleotide incorporated into the extending 
primer or into an RNA copy is normally the correct base that pairs with tiie base in die 
target polynucleotide. 

. The sequencing metiiod can be carried out in two ways. The four nucleotides can be 
labelled witii tiie same label e.g. one fluorophore. The primer/template 
polynucleotides can be contacted witii one nucleotide (e.g. adenine). The 
unincorporated nucleotides can tiien be removed either by being washed way or 
degraded by an enzyme such as apyrase. Any nucleotides tiiat have been incorporated 
can tiien be detected. This process can tiien be repeated witii tiie otiier tiiree 
nucleotides (e.g. tiiymine. cytosine and guanine). Alternatively tiie four nucleotides 



can be differentially labelled i.e. each has a different label or fluorophore. In this case 
the primer and template poljoiucleotides are contacted with two or more of the 
labeUednucleptides-at;the-same:tiine.-A^ 

incorporated bases are detected. The use of four differentially labelled nucleotides 
allows real time monitoring of the synthesis process. 

In one embodiment the label can be "directly attached" to the nucleotide via a 
covalent bond to the base, sugar moiety or alpha-phosphate. 

Thus, in another aspect the present invention provides a method of sequencing a target 
polynucleotide comprising the steps of: . 

(a) extending a primer annealed to said target polynucleotide utilising a labelled 
nucleotide wherein the label is directiy attached to the nucleotide; 

(b) detecting the presence or absence of said labelled nucleotide within said 
extended primer; 

(c) removal of said labelled nucleotide, and replacement of said labelled 
nucleotide with a degradation resistant nucleotide; and 

(d) repeating steps a-c; 

wherein the 3' end of said primer comprises at least one degradation resistant 
nucleotide. 

When the nucleotide is attached through a direct covalent bond to the label, then the 
replacement of the labelled nucleotide with an unlabelled nucleotide comprises 
removing the entire labelled nucleotide, and replacing it with a degradation resistant 
unlabelled nucleotide. *T)egradation resistant" nucleotides are nucleotides which are 

nQtJ3£mDired_froni the nxntheciced niicleodde izequence by degradation a^zntz. 
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In one embodiment this is preferably done by removing nucleotides ftom the 3' end. 
Such agents include exonucleases, such as exonuclease HI and includes DNA 
polymerases which possess 3' - 5' exoiiuclease activity. These enzymes include T4 
polymerase, and Kcoli DNA polymerase I (DNAPI). Preferably the degradation 
5 resistant nucleotides are alpha -s nucleotides. They may also be methylphosphonate 

Unkages. Also for example boranophosphate modification at the alpha-phosphate 
group in 2'-deoxycytidine 5'-triphosphate (dCTP) (He et al 1999). Degradation 
reagents may also be chemical reagents such as mUd reducing agent or mild acid. 
Physical degradation reagents include ultra-violet light for cleaving photocieavable 
10 bonds. 

This embodiment is sunilar to that disclosed in US2003013101 which uses 
exonuclease activity to remove a nucleotide m its entirety, but in the present invention 
by using degradation resistant nucleotides the removal of more nucleotides than the 

15 last one added in a specific synthesis cycle is prevented. The incorporation of die 

degradation resistant nucleotide shifts the sequence register to the next position for the 
next cycle. The scheme also begins with primers that have degradation resistant 
nucleotides at their 3' end or when synthesis is initiated from a nick in double 
stranded DNA the first addition is of a degradation resistant nucleotide rather than a 

20 labelled nucleotide. 

In another embodiment, one which can be carried out during either RNA (Gueroui 
2002) or DNA synthesis, the degradation reagent cleaves an internal intemucleoside 
bond. In one such preferred embodiment, foUowing detection, a labelled degradation 
25 labile nucleotide is replaced by a degradation resistant nucleotide in order to shift the 

register to the next position in the sequence. In this case the degradation labUe reagent 
may be a nucleotide modified at the 5' position with NH2 (Wolfe 2003; Shchepinov 
2001 and the degradation resistant nucleotide can be a a normal nucleotide. NH2 
modified nucleotide analogs can be efficiently incorporated into DNA by the Klenow 
30 firagment of Escherichia coU DNA polymerase. Mild acid treatment of the resulting 
DNA specificaUy cleaves the modified intemucleoside bond. This approach can be 
carried out by primer mediated DNA syntiiesis or promoter mediated RNA syntiiesis. 
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Details of synthesis of NH2 nucleotides is provided by Wolfe et al (2003). Thb 
nucleotides can be labelled by standard methods (e.g. see Hennanson, GT or Mitra 

^2003) ; „ 

The labelled nucleotides that are incorporated may be dNTPs or ddNTPS. The 
disadvantage of adding dNTPS is there is no absolute certainty as to how many 
fluorescent bases become incorporated at each cycle. The disadvantage of labelled 
ddNTPS is that although only one labelled base will be added, after detection and 
removal of this'base, the nucleotide which replaces it needs to allow incorporation of 
the next fluorescently labelled nucleotide and therefore cannot be blocked at the 3' 
end. However this may allow multiple bases to be incorporated. Therefore it is 
possible that more than one base may be added This would shift the register beyond 
the last base that has been sequenced. This can be overcome by manipulating 
conditions, reaction components and reaction times so that, statistically, there is only 
the chance of incorporating one nucleotide complementary to each single molecule 
template, in one cycle. A pulse-chase reaction can be done in which the nucleotides 
are added for a short burst followed by a chase with apyrase enzyme which degrades 
free nucleotides. Addition of a labelled nucleotide may involve a dijfferent enzyme to 
addition of a degradation resistant nucleotide. Altematively, removable blocking 
groups, as described in any of the prior art can be added to the 3' end of the 
unlabelled nucleotides. These can be removed before addition of the next labelled . . 
base, 

Altematively instead of using blocked fluorescent nucleotides or blocked degradation 
resistant nucleotides the incorporation of more than one nucleotide is allowed to 
occur. Tlie number of fluoreccent bapec added r,m then b f deduced from the 
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Also to prevrait addition of more than one nucleotide at a time both the degradation 
resistant and degradation labile nucleotide jnay be modified or the leactioh configured 
to prevent the addition of more than one nucleotide during one cycle. 

5 

In one preferred embodiment, a blocking group is added to the at the 3' position on 
the degradation labile nucleotide, restricting polymerization to a single base addition. 
The degradation reagent is able to remove die modified degradation labile nucleotide. 
After cleavage of tiiis nucleotide by the degradation reagent, the degradation resistant 

10 nucleotide tiiat replaces it also contains a blocking group at die 3' position, again 
restricting polymerization to a single base addition. This ensures tiiat the sequence 
register is shifted by the required single position only. Altiiough further degradation 
resistant nucleotides are not able to react at this 3' position, the degradation labile 
nucleotide is able to react at this position. Hence the process can continue. For 

15 example the degradation labile nucleotide may have an NH2 group at the 5' position 
which will have different reactivity ttian a 5' phosphate on the degradation resistant 
nucleotide. 

Alternatively, altiiough die degradation resistant nucleotide is not chemically blocked, 
20 the polymerizing reagent used to add the degradation resistant nucleotide may not be 
able to incorporate more than one degradation resistant nucleotides but the 
polymerizing reagent used for addition of tiie next labelled degradation labile 
nucleotide is able to add on to this base. Different enzymes have different 
processivities and diflferent capacities to deal witii nataral and modified DNA 
25 nucleotides. The different steps may utilise different degradation reagents and 

different polymerization reagents. It should be noted that that although a firequent 
changing of polymerizing reagent is expensive, it is justifiable if sequencing is done 
on a large number of molecules in parallel. If tiie reactions are done in microfluidic 
channels the amount of reagents will be small and if a system of valves is 
30 incorporated onto a sequencing chip, tiie reagents which wiU usually be provided in 
excess amounts can be stored in designated chambers on the chip and re-used. 
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In one embodiment the label is attached to the nucleotide by means of a linker which 
comprises a cleavable bond, as described above. In ttie meHiod of the invention, the 

^^.-j:eplacement of- saidlabelled nucleotide-mth-an unlabelled^uclw — 

removal of the label from said nucleotide. 

5 

The cleavable bond can be cleaved following the detection of the presence or absence 
of the labelled nucleotide. The label can be attached in such a way that it blocks the 
incorporation of further nucleotides. This ensures that the only one labelled 
nucleotide is incorporated. 

10 

ITie cleavable bond can be cleaved by means of light (if it is photocleavable) or by 
. using a mild reducing agent if it comprises a disulphide bridge. Mild reducing agents 
include mercaptoethanol and DTT. 

15 Thus in one aspect the present invention provides a method of sequencing a target 
polynucleotide comprising the steps: 

(a) Carrying out template derived nucleotide synthesis utilising a labelled 
nucleotide wherein the label is attached to the nucleotide via a cleavable 
linkage; 

20 (b) detecting the presence or absence of said labelled nucleotide within the 

synthesised polynucleotide; . - , 

(c) cleaving said label jtom said nucleotide; and 

(d) repeating steps a-c. 

25 In one preferred embodiment, the linkage attaching the label to the nucleotide 

camraices a binding pair. One member of the binding pair is linked ro the micleotide 
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therefore complementary to a particular spatial and polar organisation of the other 
member of the pair of molecules. Thus, the members of the pair have the property of 
binding specifically to each other. Examples of types of binding pairs are antigen- . 

antibody, biotin-avidin, hormone-hormone receptor, receptor-ligand, enzyme- 
5 substrate. The use of a linkage comprising a binding pair means that the nucleotide 
added onto the primer may be labelled after it has been incorporated into the primer. 
The nucleotide is attached, preferably via a cleavable Unker to one member of a 
binding pair. The label is attached to the other member of the binding pair. The label 
can then be attached indirecfly to the nucleotide as tiie two members of the binding 
10 pair bind one another. 

Each of the four types of nucleotides can be attached to a different binding pair 
member. The other members of tiie binding pair can be labelled differentially, i.e. 
each is attached to a different fluorophore or nanoparticle. This aUows all of the 
15 nucleotides to be added at the same time. The nucleotide incorporated is tiien labelled 
witii the respective fluorophore via tiie binding pair mechanism. For example adenine 
is attached to biotin, and cytosine is attached to digoxigenin. The fluorophore 
indicating tiie presence of adenine is attached to avidin, and tiiat for cytosine is 
attached to anti-digoxigenin. 

20 

Thus in one aspect the present invention provides a metiiod of sequencing a target 
polynucleotide comprising the steps of; 

(a) carrying out template derived nucleotide syntiiesis using a new nucleotide 
attached by a cleavable linkage to one member of a binding pair; 
25 (b) contacting said nucleotide witii a label attached to tiie otiier member of tiie 

binding pair under conditions such tiiat tiie two members of tiie binding pair 
bind to one another, 

(c) detecting the presence or absence or said label; 

(d) removal of said label and said binding pair by cleaving said cleavable linkage 
30 between the first and second members of the binding pair; and 

(e) repeating steps a-d. 

Alternatively the fluorescent tag may comprise a nanoparticle. 



f 
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The tOTn "nanoparticle" as used herein means an individual particle which has a 
.maxiTOum.to 

use in the present invention are preferably spherical, and/or preferably have a 
5 diameter of 20nm or less. 

The fluorescent nanoparticle can be a particle which has a large number of 
fluorophores embedded within or on its' surface (eg latex particles). Altematively, 
fluorescence emission or modulation may be an inherent property of the particle as is 

10 the case for semiconductor nanocrystals (Quantum Dot Corp., USA; Evident 

Technologies, USA), gold nanoparticles (Nanoprobes Inc., USA), plasmon resonant 
particles (PRPs) (Seashell Technologies, USA), Resonance light-scattering particles 
(RIP) or TiO 2 nanoparticles (Paunesku 2003). Depending upon their size and 
material, senaLconductor nanocrystals emit in different regions of tiie electromagnetic 

15 spectrum, even when excited with the same wavelength. Special coating procedures 
are applied to stabilize them in solution and make possible their conjugation with 
different objects. The advantage of nanocrystals is their high brightness of enaission, 
high stability against photobleaching and that they have narrow enoonision spectrums, 
which facilitates multi-plexing. Semiconductor nanocrystals, of various emission 

20 wavelengths, with surfaces coated with streptvidin or biotin are available form 

Quantum Dot Corp. The streptavidin-biotin interaction can be mediated in the vendor - 
suppUed incubation buffer. 

Nanoparticles can also be attached to a modified nucleotide via a thiol 
25 (sulfhydryl/mercaptan) group. Thiol groups can be attached to metals, in particular, 

gold. .^Jismatively. a linbrr can be uced ro attach the diiol to the nucleotide. The 
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is then depiotonated with 4-(dimethylaimno)pyridine, tendering the nanocrystal water 
soluble, and facilitating reaction with thiolated nucleotides. After DNA modification, 
the particles are separated fix>m unreactedDNA by dialysis or ultracentrifugation. 
SolubUsed nanocrystals. emitting at various wavelengths are commercially available 
e.g. from Evident Technologies. Under the relatively mild reaction conditions used for 
enzymatic extension there is no appreciable thiol exchange. 

As an alternative to linking the nucleotide to a nanoparticle through a thiol group, the 
binding pair linkage system described above can be used. The nucleotides can be 
attached to one member of a binding pair (e.g. biotin) through a cleavable Unker and 
the nanoparticle may coated with the otiier half of the binding pair, e.g. streptavidin. 
A photocleavable-Biotin-NHS reagent is commercially available from AmberGen 
which can react with amine groups m the nucleotides. A SNHS-SS-BIOTIN is 
available from Uptima and Pierce Biotechnology (EZ-Iink Sulfo-NHS-SS-Biotin) 
which can be attached to amines on DNA bases and is cleavable by a mild reducing 
agent. In addition Photoprobe biotin reagent is available from Vector laboratories 
which allows biotin to be linked to DNA by heat or UV exposure, with the option of a 
cleavable disulphide bridge within the linkage. Using such a procedure a fraction of 
the nucleotides may be inappropriately labelled, in a way that perturbes the correct 
function e.g. hydrogen bonding of the base whereas anotiier fraction may be correctly 
labelled. 

Nanoparticles used in this invention are seen as a diffraction limited point source of 
fluorescence just like a single fluorophore. The advantage over other single molecule 
sequencing by synthesis approaches is that the nanoparticle is much easier to detect 
than a single dye molecule. Therefore a low grade CCD camera can be used for their 
detection and illumination may be from a mercury arc lamp. Lower grade objective 
lenses may be used and oil immersion lenses are not necessary. Sophisticated set-ups 
for background eUmination, such as evanescent wave illumination may not be needed. 
Hence, the detection device is less sophisticated and cheaper than the instrument 
required for single dye molecule detection. Because a nanoparticle is easily 
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distinguished from artefacts and over background, sample preparation is easier and 
less stringent 

Nanoparticles can be easily removed after detection. A gold or CdSe^S Quantum 
5 dot can be removed by treatment with a mild reducing agent such as DTT or 

mercaptoethanoL The Au-S bond, although thermodynanodcally stable, is kinetically 
labile, leading to thiol exchange in the presence of appropriate thiol-containing 
molecules in solution, particularly at elevated temperatures. It is also possible to 
attach DNA to nanoparticles via a binding pair as decribed above, which would avoid 
10 the use of thiols altogether. The linker connecting the base to one of the binding pairs 
or the thiol may contain a cleavable bond such as a disulphide bond which can be 
removed using a mild reducing reagent. 

The nanoparticle may bear a positive charge which can interact with the nucleotide 
15 (Nakao 2003). The charged nanoparticle can be displaced by another charged species, 
following detection. 

There are two specific ways that nanoparticles can be used in the invention to achieve 
sequencing by synthesis. The first involves addition of nanoparticle in order to label a 
20 base after it has been incorporated. 

Thus in one embodiment the synthesis involves incorporation of a labelled nucleotide 
comprising:- 

25 a) Incorporation of an unlabelled nucleotide adapted for the attachment of a 
nanopardclei and 
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The second method by which nanoparticles can be used involves incorporation of 
nucleotides to which nanoparticles are already tethered. 

Thus in another embodiment the replacement of the labelled nucleotide, which is 
labelled by means of a nanoparticle, with an unlabelled nucleotide comprises 
removing the nanoparticle from said labelled nucleotide. The nanoparticle can be 
removed by cleaving the cleavable bond in the linker attaching the nanoparticle to the 
nucleotide. The cleavable bond may be cleaved by Ught if it is photocleavable, or by 
means of a mild reducing agent such as DTT or mercaptoethanol if it is a disulphide 
bond.. 

The use of nanoparticles means that additions of multiple nucleotides can be detected 
more easily because the increase in signal is not expected to be quenched and so a 
digital increase in signal intensity can be expected with increasing number of 
nucleotides. 

WO96/27025 discloses labelling nucelotides with microscopic beads in the context of 
sequencing by synthesis. The reagents used in the present invention are specificaUy 
nanoparticles of 20nm diameter and less as significantly larger beads would be too 
bulky to efficiently carry out the required molecular processes. 

The nanoparticle strategy of the present invention differs from WO96/27025 in that 
the nanoparticles are not only used for labeUing but may also be used to prevent the 
incorporation of more than one nucleotide at a time. 

In addition to detection of nanoparticles due to fluorescence, they can also be detected 
efficientiiy by electron microscopies or scanning probe microscopies (e.g. see Csaki . 
2001) 



Nanoparticles can also be used to prevent a second base from being added by steric 
hindrance or repulsion. The nanoparticles may have a polarity, which repels another. 
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For example they may be positively or negatively charged (Nakao 2003) or they may 
have a magnetic polarity or ^in (Lee 2003). 

The methods of the present invention can be carried out on an array. The term 
5 "array" as used herein relates to a spatially defined arrangement of one or more 
nucleotides in a pattern on a surface. The array can consist of individidual 
nucleotides present at at least 96, 384, 536, 10,000, 48,000 or 192,000 discrete 
locations on a surface. The array is preferably formed on a chip. 

10 The array can be a random array wherein the nucleotides are attached to the surface 
randomly. Alternatively the arrays can be spatially adressed. The nucleotides can be 
arranged in a grid pattern, with regular spacing between each nucleotide. The 
nucleotides can be located in a "spot" along with a plurality of other nucleotides of 
the same sequence. Altematively the arrays can comprise DNA colonies. 

15 

The polynucleotides can be attached either directiy or indirectiy to the surface. For 
example an enzyme, such as a ligase or polymerase, ustilised in the process can be 
attached to a solid surface. The enzyme then binds the target polynucleotide, thus 
anchoring it to the solid surface. 

20 

Altematively the polynucleotides can be capturted by oligonucelotides which are 
attached to the surface. The capture can be by hybridisation of a single stranded 
oligonucleotide to a single stranded target or a single stranded region of a double 
stranded target. Altematively, the polynucleotide or the surface immobilised capture 
25 probe, may comprise a sticlcy end or both may have a sticlcy end. The template and 
c imihecized ocrandcian Jbe. can.be ^ernxEiLsnlbi lin]re«1 fo-thf r.nrftc£±»y a ligation 
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nucleotides to fonn an array, and attaching nucleotides to an array are described in 
WO02/061126 and WOOl/57248. 

The surface is preferably glass, siUca or a polymer such as PMDS. The substrate is 
preferably a glass slide, coverslip, silicon wafer, microfabricated chip or multi-well 
plate, such as a flat bottomed optical grade 96 well plate. The polynucleotides may be 
attached to material that coats the surface. For example aminosilane coated surfaces 
supplied by Coming Inc (USA) or Asper Biotech (Estonia) can be used. The surface 
may be coated with a gel material including a sol-gel. The polynucleotides may be 
attached to beads, particles, or structures such as nanobars or nanorods which may 
contribute to the generation or modulation of a FRET signal. The surface ihay be 
metallized with for example silver.or gold particles to enhance a fluorescent or a- 
raman signal (Malicka 2003; Knedpp 1999). 

In addition, the surface or particles thereon may carry charge or be electrically biased 
of may be heated to control the sequencing process (Hamad-Schifferli, 2002). A 
charged surface is particularly useful to prevent non-specific interactions of 
nucleotides on the surface. Appropriate surface coatings include Polyethylamine as 
described by Braslavsky, 2003 and the DNA-bind sUde available from VBC 
Genomics (Austria). An electric field generated at the surface is a useful way for 
controlling the attraction and repulsion of nucleotides at the surface (Asanov 1998; 
SosnowsM 1997) 



Compared to the degree of parallelism currently available (96 Sanger sequencing 
reactions within individual capillaries, on a state-of-the-art DNA sequencer), a whole 
wafer high-density oligonucleotide array has the capacity to analyse 60 million 
reactions (e.g. see www.perlegen.com website). Until recently, the high cost of 
making individual photohthography masks meant that mefliods for making high- 
density oUgonucleotide arrays were only available for mass production of arrays an( 
were not accessible for the individual design of single arrays. However, the 
appUcation of digital micromirror technology to array synthesis has made it much 
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more straightforward and cheaper to specify^ individual arrays. A fully integrated 
benchtop device for making, hybridising to and analysing high-density arrays can 

streamline the entire array experiment to within one day, (e.g Geniom one;. . ... 

www.febitde). 

The current technology (e.g, Geniom One) uses digital micro-mechanical mirrors to 
create an array by the spatially-selective deprotection of photolabile protecting groups 
on DNA chains growing on a surface. Each new array design can simply and rapidly 
be specified by software and there is no need to make photolithography masks. Arrays 
can be made such that the sequencing can be initiated either with an array of 
oligonucleotides directed to specific regions in the genome or with an array of n-mers 
(Gunderson et al, 1998) which will inititate the process at any position which seeds 
hybridisation. Presently, the standard software is configured to synthesize 48,000 
oligonucleotides. However, by bypassing the software interface, it is possible to 
synthesize at least 192,000 sequences on one chip in one synthesis ran. All the 
synthesis, hybridisation and washing steps can be undertaken within the microfluidic 
channels of the chip provided by the manufacturer. 

In one embodiment the method is carried out using an array wherein multiple copies 
of one primer are located within a localised area. The combined signal from all the 
nucleotides incorporated is detected i.e. the "bulk" signal is detected. The signal 
detected will be that relating to the nucleotide that is incorporated the most, lii the 
embodiment wherein the nucleotides are all labelled identically and added 
individually, the strongest signal will be obtained when the nucleotide which is the 
correct base pair corresponding to the next base in the template is used. In the 
emhodiment-wherein the nucleotides ore difcexeiitiT}]]"/ l.?rr£]ledJLhe fjuorescant label 
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Alternatively Single DNA molecule ima^ng can be used to detect the template and/or 
incorporations as a point-source of fluorescence, for each individual molecule. . .. 

Molecules within arrays are distributed at a density at which substantially aU 
molecules are separated by a distance greater than that required for resolving them as 
separate entitites (defined by the diffraction limit of light). Then, instead of analysing 
a single intensity value due to the combined signal from thousands of molecules, a 
"digital" signal from each molecule can be individually assessed. This enables 
heterogeneous reactions within a microarray spot, which would ordinarily be masked 
by the signal averaging of ensemble methods, to be detected. - 



Determining which nucelotide has been incorporated in a single molecule aUows 
multiple copies of a polynucleotide to be sequenced individually, in parallel. The 
present invention enables, for example around 10^ sequence passes within a microaray 
spot. If a thousand copies of a polynucleotide can be sequenced at the same time, it is 
effectively equivalent to repeating the sequencing a thousand times. This considerably 
reduces the amount of time required to carry out this work, as compared to the 
traditional Sanger dideoxy techniques. It also provides increased confidence levels. 
This method eliminates the need for costly amplification steps, and.can be used to 
provide haplotype information. • 

The single molecule sequencing approach developed previously involves a "random- 
display of the diversity of molecules to be sequenced without any deliberate 
organization of the molecules by spatially addressable arrays. The methods of the 
invention can be appUed to such types of random arrays of single molecules. In such a 
set-up although there may be several other copies of the same sequence present 
elsewhere, at undefined positions on the surface it will be difficult to extiact statistical 
confidence in a sequence due to heterozygocity and the presence of other closely 
related sequences. Therefore, each molecule is essentially sequenced with one pass 
only. By contrast single molecule sequencmg within spatially addressable microarrays 
enables for example, arx>und 10^ molecules of the same species to be sequenced 
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within each array spot. Hence, flie confidence levels with which the sequence will be 
obtained will be unprecedented. If related but diff^ent sequences are captured within 
a_sppt, their identity will- become apparent aftenseveral cycles of base addition. Jf the 
sample is heterozygous, then the presence of two species within the spot will be seen. 

Although the methods of the present invention are preferably carried out on a solid 
surface, they can also be conducted on molecules which are free in solution for 
example in the wells of a microtitre plate or within micro or nano-scale vials, wells or 
structures Ql^vene et aT 2003). 

The methods of this invention are preferably undertaken on surfaces because it is 
easier to organise and/or monitor reactions on a surface than reactions freely diffusing 
in solution. However, wheri the molecules are inamobilised on beads which are able to 
diffuse, one can take advantage of the improved reaction kinetics of solution phase 
reactions. A sequence strategy has been described for molecules immobilised on 
beads (Brenner et al 1999) and the methods of the present invention coidd be applied 
on this platform. 

As the invention can be applied in a single molecule detection mode it is very 
sensitive and can be performed on small amounts of sample material. Hence the 
• invention can-also be applied in a context where one or very few molecules are • - 
available, such as from ancient DNA or a forensic sanq)le. 

Although the invention can be carried out on a purified firagment of a polynucleotide, 
it offers particular advantages for sequencing poljoiucleotides directiy from a complex 



21 

in different fractions. This is needed particularly when the invention is implemented 
in bulk mode. When the invention is implemented in single molecule mode the 
identity, of each individual molecule is fuUy detennined regardless and hence a 

contaminating sequence can be identified. However to avoid unnecessary sequencing 
5 repetitive DNA can be suppressed by for example subtraction with Cot-IDNA. The 
invention can be implemented on spatially addressable arrays so that different regions 
of tiie genome or different species in a mRNA population are captured at specific 
known locations. One advantage of this is tiiat capture probes provide a certain length 
of sequence information even before the sequencing by synthesis data is obtained. 

10 

Methods of this invention can be carried out in heterogeneous way, where reaction 
. components for the different steps of tiie reaction are provided at separate stages. The 
methods can also be carried out in a "homogeneous" way, whare all the components 
required for the reaction are provided in the reaction vessel from the start. Then 
15 cyclical electix)magnetic modulation, for example for cleaving a linkage provides a 
clocking mechanism for shifting the sequence re^ster. Furthermore, some of the 
methods of this invention can be carried out in real-time, by providing reaction 
components and then continuously monitoring the reaction. PrefCTably for fliis 
embodiment the signal is detected by a FRET mechanism as described below. 

20 

Sequencing by FRET 

In fluorescence resonance energy tiransfer (FRET), a donor fluorophore molecule 
absorbs excitation energy and delivers this via dipole-dipole interaction to a nearby 
acceptor fluorophore molecule (Stryer, L. andHaugland, R.P. 1967.). Huorescence 

25 resonance energy transfer can be used to cut out background fluOTescence in single 

molecule experiments OBraslavsky et al 2003). Recentiy. a new way of using FRET in 
a DNA assay, termed iFRET has been introduced in which the donor dye is an DNA 
intercalating dye that is used to stain DNA (Howell WM et al. 2002). iFRET is 
reported to give fluorescence values that are 2.5 times greater than those obtained 

30 from the intercalating dye alone, and more than 40 times greater than those from 

conventional FEIET. It is suggested that the reason for the difference may be that the 
iFElET system involves the channelling of an accumulation of energy from a chain of 
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donor dye molecules (in contrast to a single donor in the FRET system) into the 
acceptor moiety, which is then able to re-emit energy unhindered. Double-strand, 
?DNA-specificintercalatingxiye (e.g.,.SYBR Green I) has been used as a-FRET.dono^,- 
with a conventional FRET acceptor. 

A FRET mechanism can be implemented with the sequencing by synthesis methods 
described in this invention. One embodiment of the present invention involves the 
detection at the single molecule level, using FEIET between two or more FEIET 
partners. The FRET partnership system comprises two or more partners each attached 
to a reaction component selected ficom the group comprising nucleotide, the template, 
the polymerasing agent or any other reagent involved in the polymerization reaction. 
Donor-acceptof fluofophore pairs are chosen so that the emission spectrum of the 
donor overlaps with the excitation spectrum of the acceptor; many different 
combinations of available fluorescent labels can be used. 

In one preferred embodiment the FRET means of detection is utilised in a method 
wherein the labelled nucleotide is detected as it approaches the target DNA molecule. 
As the labelled nucleotide is brought into the proximity of the target polynucleotide 
during polymerisation, the FRET reaction occurs between the label on the nucleotide 
and a FRET partner. This reaction can be detected. The FEIET label is attached to the 
nucleotide through the beta or gamma phosphate groups. These phosphate groups are 
removed as the nucleotide is added during extension, so effectively the detection of 
the label, the extension, and the replacement of the labelled Jiucleatide-.with-.an„ 
unlabelled nucleotide occur almost simultaneously. When flie nucleotide has been 
incorporated it is no longer labelled. The released-pyrophosphate is firee to diffuse out 
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In another preferred embodiment FRET occurs betwe«i a DNA stain (e.g. an 
intercalating dye) bound to DNA and one or more FRET partners attached to another 
polymerisation reaction conq)onent such as the nucleotide or polymerising reagent. ..... 

The bound DNA stain may act as FRET donor or acceptor. Jt is simple to add a DNA 
stain that incorporates at multiple positions along a template molecule so that it can 
contribute to a FRET reaction anywhere along the extending chain. One of the FRET 
partners may be the fluorescently labelled nucleotide, which is utilised to extend the 
polynucleotide being synthesised. The fluorescent label may be directiy or indirectly 
attached to nucleotide, and it may be a nanoparticle. Preferably the DNA stain is not 
the first FRET Donor as this could lead to it's wholesale photobleaching; although 
this can be minimised with judicious choice of antif ade composition. Several DNA 
stains are available for staining double-stranded DNA and a few of these are also able 
to stain single-stranded DNAs relatively efficiently, e.g. SYBR Gold. 

Alternatively, or additionally the FRET partner can be attached to the polymerase, for 
example the DNA polymerase. The FRET label may be in the form of a 
semiconductor nanocrystaVQuantum Dot, as these do not photobleach which is 
important as it is desirable to retain the same polymerase throughout synthesis. 

Multiple FRET interactions can take place when the excitation and emission 
spectrum for FRET partners overlap. The first FRET partner is excited at one 
wavelength, and it's emission wavelength overlaps with the excitation wavelength for 
the second a FRET partner. The second FRET partner has an emission wavelength 
which overlaps with the excitation wavelength for a third FRET partner. In this way a 
chain of energy transfras can take place, when the FRET partners are within.FRET 
range and the first donor has been excited. This can result in a large stokes shift i.e. 
large separation of excitation from emision. This aUows the signal to be read at a 
wavelength far removed from to the original excitation wavelength, which is 
advantageous for eliminating bleed-through from the excitation source into the 
detection channel. Importantly, this method also ensures that aU of the components 
(the target polynucleotide, the labelled nucleotide and the polymerase) are aU in close 
proximity. In some instances an anti-stokes shift may be utiUsed. 
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Howell et al describe a system in which the intercalating dye acts as donon A Single 
-moleeide-system-naay-involve-§yber-green-l-as-tbe-d^ 

as the acceptor. As Quantum Dots can be excited at various wavelengths, when they 
are used as the acceptor, donors emitting at various wavelengths can be used, e.g. 
DAPI or S YBR gold Alternatively, the incorporated fluorescent nucleotide or a 
fluorescent nanoparticle can act as the donor and an intercalating dye such as P0-P03 
can be used as the acceptor (Nakayama et al 2003). The Quantum Dot can be excited 
at a wavelength far removed from the acceptor dye. The signal produced would be 
due to the localised excitation by the Quantum Dot of a few fluorescent dyes in its 
locality. Following detection of the FRET signals, an image of the polynucleotide 
polymer can be taken by exciting the DNA stain directiy. The FRET signal can then 
be supCTinaposed on the polynucleotide polymer image, to determine where 
incorporation has occurred. 

Because energy transfer to the acceptor is from a highly localised source, background 
fluorescence from anything beyond die FRET range, which is about lOnm, does not 
contribute to background fluorescence. Hence FEIET would enable reactions to be 
monitored continuously without the need for washing away of unboxmd fluorescent 
dyes or nanoparticles. This would enable addition of more than one nucleotide to be 

detected in real-time. The system could be homogeneous in that all that was needed 

for the reaction can be placed in the reaction vessel at the start of synthesis. It would 
be desirable to retain some form of agitation or mixing of the reaction solution to 
enable pyrophosphate to diffuse out of FRET range after it has been released. 

lip, n^rrn^TtTi*-^^- ^ nth i H.=: \n one ^SypeH the i^^ran Hrr* rr^-^Arlp^i r m.^rhr^H nf 
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Preferably this melhod is used to carry out teal time monitoring of the sequence. 

The polymerisation reaction components include the polymerizing reagent.and the ., 
template polynucleotide. Preferably a DNA stain is used to label the template and a 
Quantum Dot is used to label the polymerizing agent. 

A nucleotide that may be temporarily resident within the FRET range of a 
polymerasing agent or a template molecule, may or may not get incorporated 
depending on whether it is the correctly matched nucleotide for the position in 
question. This temporary resident of the FRET locality must be distinguished from a 
nucleotide that is actually incorporated. This can be done by utiUsing information 
gathered prior to the reaction about, for example, the longevity or strength of the 
FRET signal depending on whether it originates from a nucleotide temporarily 
resident within the FRET locality or a properly incorporated nucleotide. WOOO/36151 
describes a mechanism in which the dye attached to the nucleotide remains quenched 
by a quencher until incorporation of the nucleotide occurs at which point the quencher 
becomes detached and aUows the dye to fluoresce freely. The drawback of this 
approach is that there is Ukely to be loss of quenching which is not due to loss of 
quencher but is due to thermal or stractural fluctuations or photobleaching. An 
alternative way of measuring incorporation in the context of the present invention is 
by detectmg quenching/de-quenching or preferably a wavelength shift witli a FRET 
partner which occupies a different reaction component than the nucleotide itself. For 
example, the emission due to the FRET partner on the template may be modified by a 
FRFT Partner on the beta or gamma phosphate of the nucleotide. When the nucleotide 
is incorporated and pyrophosphate is released, the FRET interaction is aboHshed and 
. hence a fluorescence property of the FRET partner on the template is modified, eg, it 
emits fluorescence at a shifted wavelength. The first donor in this scheme may be a 
Quantum Dot attached to the polymerizing agent and the whole process may be 
designed to have multiple FRET interactions which are able to be monitored in real 
time, by using for example an image spUtter such as the Quad-view from Optical 
Insights Inc. (USA). 
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Appropriate anti-fades can be used to attenuate photobleaching. This can include the 
provision of Oxygen scavengers and reducing agents such as DTT and 
— -r:— — M^rcaptoeflianQl-t . '■■ . — -r-zr-r-. — ; . • .- — 

5 Linear Polymer Display 

Genomic sequence would have much greater utility if haplotype information (the 
association of alleles along a single DNA molecule derived from a single parental 
chromosome) could be obtained over a long range. This is possible by combining the 
SbS process of the present invention with the sin^e molecule display of linearised 
. 10 genomic DNA described in WO02/074988. Here each template molecule is sorted on 
the array, and combed out to provide a linear display of sequence along its length. 
Polymerisation can then be seeded at multiple positions on each linear molecule, e.g. • 
optimised to be every lOkb apart. The incorporations are monitored as slowly 
migrating point sources of fluorescence along the linearised DNA polymer. The 
15 introduction of nicks in the double stranded DNA is sufficient to prime synthesis. 
DNA stains such as S YBR Gold can be used to visualize die DNA polymer. This 
method can be used to confirm that it is the template polynucleotide that is being 
sequenced, as opposed to some other contaminant. 

20 Resolving ambiguities in the sequence 

If the sequencing approaches are carried out in a microarray format and the array 
making and sequencing is iterated, then ambiguities in the number of bases at any 
particular position can be resolved by makmg probes that would address each of the 
suspected sequence possibilities in the next array synthesis. 

25 
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Sequencing to obtain gene expression information 

The method of the present invention can be adapted to obtain gene- expression date,,:. 
particularly from a single cell. Once a certain length of sequence information has 
been obtained, it can be used to identify the mRNA species. The method can be 
modified for sequencing mRNA. Thus in one embodiment the target polynucleotide 
comprises mRNA. The mRNA can be hybridised to primers which are designed to 
hybridise to any mRNA molecule. For example, primers can be designed to hybridise 
to all sample mRNA species at a specific point in the mRNA primary structure. This 
point could be the polykdenylation signal. AAUAAA, the Poly A taH at the 3' end or 
the cap structure at the 5' end or a specific sequence clamped onto the 5' or 3' end- 
Preferably the primers are attached to a solid surface, and more preferably form an • 
array. 

Thus in one aspect the present invention provides a method of sequencing mRNA 
comprising: 

a) contacting an array of probes designed to hybridise to mRNA molecules with 
a sample of mRNA under conditions whereby the mRNA will hybridise to 
said probes; and 

b) sequencing said mRNA utilising a method as descibed herein 

Co-sequencing two samples to find the differences between them 

The DNA or mRNA fiom two or more individuals or populations can be compared by 
differentially labelling each template (i.e. labelling the template with a different label 
for each population or individual), inmiobilismg them on a surface and thrai 
sequencing them simultaneously. The templates can be labelled, for example by •• 
attaching an oUgonucleotide containing different fluorescent dyes by using RNA 
ligase. 

The templates can be immobiUsed by attaching the labelled nucleotides to a surface as 
described above. The templates can be used to form an array. Alternatively the 
templates can be captured on to an array. This can be done for example, if the 
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templates nucleotides are allowed to hybridise to primers which themselves fomi an 
array. 



After, the templates are immobilised on the surface, the foci for each sample will be 
5 detected and recorded Following this, the label can be photobleached and the 
sequencing can commence. 

Detection sdiemes and instrumentation 

The images of the polynucleotides are projected onto the array of a Charge-couple 
10 device {CCD) camera, from which they are digitized and stored in memory. The 
images stored in memory are then subjected to image analysis algorithms. These 
algorithms can distinguish signal from background, monitor changes in signal 
characteristics, and perform other signal processing functions. The memory and signal 
processing may be performed off-line on a computer, or in specialized digital signal 
15 processing (DSP) circuits controlled by a microprocessor. 

When the base-by-base incorporation of labelled nucleotides is monitored on 
molecules in bulk (i.e. the combined signal from a population of polynucleotides 
within one spot on an array is measured), the established methods for scanning 
20 microarrays can be used to monitor incorporation at microarray spots after each base 
addition. For example a Genepix"scaimer(Axon instruments) or a Scanarray 
(Packard) which can be linked to four different laser lines can be used. 

When individual molecules within the microarray spot are analysed directly, then 
5 wide iBdd CCD imaging is used. CCD imaging enables a population of single 
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view 
noise 



magnification of the objective, any magnification due to the C-mount and, the size 
and number of pixels of the CCD chip. TypicaUy, a microatray spot can be viewed by 
either a 40X or 60X objective depaiding on CCD camera and C-jnount. ThOTefpre to . . , 
large regions of a slide (several cm^) multiple images must be taken. A low 
high sensitivity camera is used to capture images. There are several camera 
models that can be used; Cooled Micromax camera (Roper scientific) controlled by 
MetaMorph (also MetaView software; both from Universal Imaging). MetaMorph can 
be run on a Dell OptiPlex GX260 personal Computer. 

Microatray spot-finding and single molecule imaging within microarray spots 
MetaMoiph's optional microairay module and a low magnification objective can be 
used to locate spots before taking a CCD image of each of the spots using higher 
magnification. 

As the signal ftom the spots containing singly resolvable molecules is very low under 
low magnification, a marker dye, which emits at a different wavelength to ttie sample 
emission can be included in the spots to help locate them. The objectives need to be of 
high numerical aperture (NA) in order to obtain good resolution and contrast The 
integration of an autofocusing capabiUty within the procedure to maintain focus as the 
sUde is scanned, is usefiil especially when Total Internal Reflection Fluorescence 
microscopy (TIRF) is employed. Software can be used to control Z movement 
(integral to motorized microscopes) for the purpose of autofocusing. Images of 
microairay spots can be obtained by x-y movements of the sample stage (e.g. using 
Prior Scientific's Proscan stage under MetaMoiph control). To avoid photobleaching 
it is advisable to use a shutter (e.g. fix>m Prior Sdentific) to block off illumination 
. while moving from one spot to another, A controUer can be used to control X-Y . • 
stage, the filter wheels and shutter, (eg Prior Scientific ProScan). 

Once the spots are found, their coordinates are recorded by the software controlling 
the instrument and then after each base addition, a CCD image is taken of each spot of 
flie microairay. 
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In addition to the instrument being used for looking at a microarray where template 
molecules have been captured by probes, a large number of samples can be gridded 

{a'S"Onicraarra^ 

samples may be individual nucleotide populations or a set of differentially labeled 
nucleotide populations. 

Two imagmg set ups, Total Internal Reflection Fluorescence microscopy (TIRF) and 
epi-fluorescence naicroscopy have been used. 

Epifluorescence Microscopy 

linages of single molecules labeled with a single dye molecule can be obtained using 
a standard epi-fluorescehce microscopy set up, Using high numerical aperture (NA) 
objectives and a high grade CCD camera. However, the image can be hazy. In order 
to obtain a clearer image it is preferable to use deconvolution software to remove the 
haze. Deconvolution modules are available as drop-ins for MetaMoiph software. 
When the single molecules are labeled with nanoparticles the camera and objectives 
may be of a lower grade and oil-inraiersion objectives may not be required. 

Total Internal Reflection Fluorescence Microscopy (TIRF) 

TIRF enables very clean images to be obtained, for example using off the shelf 

systeni for Objectivb style TIEIF (such as those produced by Olympus or Nikon). A 

full description can be found in the brochure at the following website: www^nikon- 

instrumentsxom/uk/pdf^rochure-tirf.pdf 

Objective style TIRF can be used when the sample is on a coverslip. However, it is 
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Multi-colour single molecule imaging 

When the sequencing strategy involves the sequential addition of each of the four • „ • 
nucleotides all labeled with a single fluorophore such as CyS, then a single CCD 
image is taken after each base addition. However, if each nucleotide is differentially 
labeled (i.e. each nucleotide type is labelled with a different fluorophore) and added 
simultaneously, then the signal from each of the differerent fluorophores needs to be 
acquired distinguishably. This can be done by taking four separate images by 
switching excitation/emission filters. Alternatively, an image (Wavelength) spUtter 
such as the Dual View (Optical Insights, Santa Fe. NM) or W View (Hamamatsu, 
Japan) which direct the Ught through two separate bandpass filters with Utde loss of 
Ught between them; can be used for imaging two different wavelenghts onto different 
portions of a CCD chip. Alternatively the Ught can be spUt into four wavelengths and 
sent to the four quadrants of a CCD chip (e.g Quad view from Optical Insights). This 
obviates the need to switch filters using a filter wheel. A MetaMorph drop-m for 
single image dual or midti-emission optical splitters can also be employed. Lnage 
splitting can be used to monitor FRET. 

Monitoring sequencing on single molecules randomly distributed on a surface 
As an alternative to microarray spot finding prior to single molecule imaging and for 
implementations where the single molecules to be analysed are not organised within 
the spatially addressable microarray spots, a series of images of the surface can be 
taken by x-y translation of the sUde. A super-wide field image is then composed by 
stitching each of the images together. This process can be automated to form a high 
throughput system, utilising computer software to control the process. Such a process 
- could include robotic sample preparation. A small desktop apparatus with an 
integrated biosensor could be developed. 

Where real-time sequencing is carried out, translation of sample with respect to CCD 
camera may be too slow to detect each molecular event. Therefore a metiiod for 
coUecting single molecule data on a surface by taking images simultaneously with an 
array of CC3D chips can be applied. 
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This methods of this invention may aid functional genomics by enabling phenotype- 
. genotype-eeH©lations-t© b^-made-at-a-hi:^TrQ5otodQnd\Iassive^^^^ . 
would enable fine-scale genotype-phenotype correlations to be made for biological 
studies, improve methods for determining the functional contribution of genes to 
diseases and be the basis for developing personalized medicines which are tailored to 
a persons functional genetic make-up. 
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CLAIMS 

1, A method of sequencing a target polynucleotide comprising the steps of: 

(a) Carrying out template derived nucleotide synthesis utilising a labelled 
nucleotide; 

(b) detecting the presence or absence of said labelled nucleotide; 

(c) replacing said labelled nucleotide with an unlabelled nucleotide; and 

* 

(d) repeating steps a) to c) 

with the proviso that if said labelled nucleotide is labelled with a label directly 
attached to the nucleotide, then ttie replacement of said labelled nucleotide 
comprises removal of the whole of said labelled nucleotide and replacement 
with an unlabelled nucleotide, and only said labelled nucleotide can be 
removed. 

A method as claimed in claim 1 wherein said target polynucleotide is attached to a 
solid surface. 

A method as claimed in claim 1 or claim 2 wherein said labelled nucleotide is 
labelled with a fluorescent tag. 

A method as claimed in claim 3 wherein said fluorescent tag is attached direcfly to 
said nucleotide. 

A method as claimed in claim 4 wherein step (c) comprises removal of said 
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7. A method as claimed in claim 6 wherein said nanoparticle is a semiconductor 
nanocrystal. 



8. A method as claimed in any one of claims 3. 6 or 7 wherein said fluorescent tag or 
said nanoparticle are attached to said labeUed nucleotide by a linkage. 

9. A method as claimed in claim 8 wherein said linkage comprises a cleavable bond. 

10. A method as claimed in claim 8 or claim 9 wherein said linkage comprises a 
binding pair. 

11. A method as claimed in any one of claims 7 to 10 wherein step (b) comprises 
incorporation of an unlabelled nucleotide ad^ted for the attachment of a 
nanoparticle; and attaching said nanoparticle to said unlabelled nucleotide. •; 

12. A method as claimed in any one of claims 7 to 11 wherein step (c) comprise? 
removing the nanoparticle ftom said labelled nucleotide. ■> 

13. A method as claimed in claim 12 wherein said nanoparticle is removed from said 
labelled nucleotide by cleaving the cleavable bond in the linkage attaching said 
nanoparticle to the nucleotide. 

14. A method as claimed in claim 13 wherein said linkage attachs one member of a 
binding pair to the nucleotide, and the other member of the binding pair is 
attached to said nanoparticle. 

15. A method as claimed m claim 6 wherein said linkage comprises a binding member 
attached by a cleavable bond to said nucleotide and the other binding member is 
attached to said fluorescent tag. 

16. A method as claimed in claim 15 wherein step (d) comprises removal of said 
fluorescent tag by cleaving said cleavable bond. 
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... 17,- A method as claimed in anyone of claims 1 to 3 wherein step (b) is carried out by 

means of an imaging technique utilising FRET (fluorescent resonance energy 
5 transfer). 

18. A method as claimed in claim 17 wherein said target polynucleotide is treated 
with a DNA intercalating dye. 

10 19, A method as claimed in claim 17 or claim 18 wherein said labelled nucleotide is 
labelled with a label that acts as a FEUBT partner. 

20. A method as claimed in anyone of claims 17 to 19 wherein steps (a)-(c) occur 
simultaneously, 

15 

21. A metihod as claimed in any one of claims 1 to 20 wherein said target 
polynucleotide forms part of an array. 

22. A method as claimed in claim 21 wherein step (c) comprises measuring the signal 
20 generated by a plurality of said labelled nucleotides. 

23. A method as claimed in claim 21 wherein step (c) comprises detecting the 
presence or absence of said labelled nucleotide for each individual polynucleotide. 

25 24. A method as claimed in claim 23 wherein said detection is carried out by means of 
single DT T A rnol 3cule im;2ging. 
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27. A method as claimed in claim 26 wherein the label on said labelled nucleotide acts 
as a FRET partner to said DNA intercalating dye. 

28. A method of comparing two or more polynucleotide sequences comprising: 

a) differentially labelling the nucleotide sequences being compared; 

b) immobilising said nucleotide sequences on a surface; 

c) detecting the locus of each nucleotide sequence; and 

d) sequencing said polynucleotide sequences using a method as claimed in any of 
claims 1 to 27. 

29. A method as claimed in claim 28, further comprising photobleaching th^^ label - 
prior to the sequencing of said polynucleotide sequence. 

30. A method of resolving ambiguities in a polynucleotide sequence comprising: 

a) identifying an area of ambiguity in a polynucleotide sequence; J 

b) designing probes for each of the suspected sequence possibility; and ^) 

c) utilising the primers formed to sequence said polynucleotide sequence 
utihsing a method as claimed in any of claims 1 to 27, 

31. A method of sequencing mRNA comprising: - • 

a) contacting an array of probes designed to hybridise to mRNA molecules with 
a sample of mRNA under conditions whereby the mRNA will hybridise to 
said probes; and 

b) sequencing said mRNA utilising a method as claimed in any one of claims 1 to 
27. 

32. A method as claimed in 31 wherein said probe is designed to hybridise to the 
polyadenylation signal, 5' cap, 3' tail or the poly A tail. 

33. A method of sequencing a target polynucleotide comprising the steps of: 
(a) treating said target polynucleotide with an intercalating dye; 
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(b) extending a primer annealed to said target polynucleotide utilising a nucleoticie 
labelled with a label which acts as a FRET partner to said DNA intercalating 

„ dye;. ~ 

(c) detecting the presence or absence of said nucleotide by naeans of an imaging 
technique that utilises FRET; and 

(d) repeating steps a-c; 

wherein steps (a) and (b) can occur in any order. 

34. A method of sequencing a target polynucleotide comprising the steps of: 

(a) extending a primer annealed to said target polynucleotide utilising a labelled 
nucleotide wherein the label is directly attached to the nucleotide; 

(b) detecting the presence or absence of said labelled nucleotide within said 
extended primer; 

(c) removal of said labelled nucleotide, and replacement of said labelled 
nucleotide with an unlabelled degradation resistant nucleotide; and 

(d) repeating steps a-c; 

wherein the 3' end of said primer comprises at least one degradation resistant 
nucleotide. 

35. A method of sequencing a target polynucleotide comprising the steps: 

(a) extending a primer annealed to said target polynucleotide utilising a labelled 
nucleotide wherein the label is attached to the nucleotide via a cleavable 
linkage; 

(b) detecting the presence or absence of said labelled nucleotide within said 
extended primer: 

( c) clea\dng saidJabel fronrisaid nucleotide: and. 
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(b) contacting said nucleotide with a label attached to the other member of a 
binding pair under conditions such that the two members of the binding pair 
bind to one another, . 

(c) detecting the presence or absence or said label; 

(d) removal of said label and said binding pair by cleaving said cleavable Unkage; 
and 

(e) repeating steps a-d. 

37. A method of sequencing a target polynucleotide, comprising the steps of: 

(a) carrying out template derived polynucleotide synthesis utilising a nucleotide 
labeUed with a Fret partner and at least one other polymerisation reaction 

. . component labelled with a Fret partner; 

(b) detennining the nucleotide incorporated by detecting Fret interactions; and 

(c) repeating steps (a) and (b). .? 
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